Search CORE

14 research outputs found

Latent Emission-Augmented Perspective-Taking (LEAPT) for Human-Robot Interaction

Author: Chen Kaiqi
Kuan Kingsley
Lim Jing Yu
Soh Harold
Publication venue
Publication date: 12/08/2023
Field of study

Perspective-taking is the ability to perceive or understand a situation or concept from another individual's point of view, and is crucial in daily human interactions. Enabling robots to perform perspective-taking remains an unsolved problem; existing approaches that use deterministic or handcrafted methods are unable to accurately account for uncertainty in partially-observable settings. This work proposes to address this limitation via a deep world model that enables a robot to perform both perception and conceptual perspective taking, i.e., the robot is able to infer what a human sees and believes. The key innovation is a decomposed multi-modal latent state space model able to generate and augment fictitious observations/emissions. Optimizing the ELBO that arises from this probabilistic graphical model enables the learning of uncertainty in latent space, which facilitates uncertainty estimation from high-dimensional observations. We tasked our model to predict human observations and beliefs on three partially-observable HRI tasks. Experiments show that our method significantly outperforms existing baselines and is able to infer visual observations available to other agent and their internal beliefs

arXiv.org e-Print Archive

Object detection meets knowledge graphs

Author: CHANDRASEKHAR Vijay
FANG Yuan
KUAN Kingsley
LIN Jie
TAN Cheston
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 01/08/2017
Field of study

Institutional Knowledge at Singapore Management University

Truly multi-modal YouTube-8M video classification with video, audio, and text

Author: et al
FANG Yuan
KUAN Kingsley
MANEK Gaurav
RAVANT Mathieu
SONG Sibo
WANG Zhe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2017
Field of study

Institutional Knowledge at Singapore Management University

Region average pooling for context-aware object detection

Author: CHANDRASEKHAR Vijay
FANG Yuan
KUAN Kingsley
LIN Jie
MANEK Gaurav
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/08/2017
Field of study